Multi-layer structure MLLR adaptation algorithm with subspace regression classes and tying
نویسندگان
چکیده
MLLR is a parameter transformation technique for both speaker and environment adaptation. When the amount of adaptation data is scarce, it is necessary to do adaptation with regression classes. In this paper, we present a rapid MLLR adaptation algorithm, which is called Multi-layer structure MLLR adaptation with subspace regression classes and tying (SRCMLR). The method groups the Gaussians on a finer acoustic subspace level. The motivation is that clustering at subspaces of lower dimensions results in lower distortion, and there are fewer parameters to be estimated for the subsequent MLLR transformation matrix. On the other hand, the multi-layer structure generates a regression class dynamically for each subspace using the outcome of the former MLLR transformation. By using the transform structure, computation load in performing transformation is much reduced. Experiments in large vocabulary mandarin speech recognition show the advantages of SRCMLLR over the traditional MLLR while the amount adaptation data is scarce.
منابع مشابه
A novel target-driven MLLR adaptation algorithm with multi-layer structure
This paper presents a novel target-driven MLLR adaptation algorithm with multiply layer structure, which is based on the thorough analysis of MLLR using the generation of regression class trees. The new algorithm is constructed on the targetdriven principal. It generates the regression class dynamically, basing on the outcome of the former MLLR transformation. The regression classes is defined ...
متن کاملA Novel Target-driven Mllr Adapatation Algorithm with Multi-layer Structure
This paper presents a novel target-driven MLLR adaptation algorithm with multiply layer structure, which is based on the thorough analysis of MLLR using the generation of regression class trees. The new algorithm is constructed on the targetdriven principal. It generates the regression class dynamically, basing on the outcome of the former MLLR transformation. The regression classes is defined ...
متن کاملRapid speaker adaptation using MLLR and subspace regression classes
In recent years, various adaptation techniques for hidden Markov modeling with mixture Gaussians have been proposed, most notably MAP estimation and MLLR transformation. When the amount of adaptation data is limited, adaptation can be done by grouping similar Gaussians together to form regression classes and then transforming the Gaussians in groups. The grouping of Gaussians is often determine...
متن کاملTransformation Sharing Strategies for MLLR Speaker Adaptation
Transformation Sharing Strategies for MLLR Speaker Adaptation Arindam Mandal Chair of the Supervisory Committee: Professor Mari Ostendorf Electrical Engineering Maximum Likelihood Linear Regression (MLLR) estimates linear transformations of automatic speech recognition (ASR) parameters and has achieved significant performance improvements in speaker-independent ASR systems by adapting to target...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004